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Field Of The Invention 



The invention relates generally to the field of digital data processing systems, and more 
particularly to a system and method for assessing the operational effectiveness of a cache memory in 
a digital data processing system. The cache memory operational effectiveness assessment can be used 
to determine whether increasing the size of a cache memory v^ould provide any significant increase 
in processing efiiciency by the digital data processing system, as well as whether any significant 
, decrease in processing efiiciency might occur if the size of the cache memory is decreased. 

Background Of The Invention 

Digital data processing systems include one or more processors for performing processing 
operations in connection v^th information stored in a memory. Typically, a memory in a modem 
digital data processing system consists of a hierarchy of storage elements, extending fi*om large- 
capacity but relatively slow storage elements and various levels of lower-capacity and relatively fast 
storage devices. The large-capacity and relatively slow devices include such types of devices as disk 
or tape storage devices which store information on a magnetic medium; such devices are relatively 
inexpensive on a storage cost per unit of storage basis. Intermediate in the hierarchy, both in terms 
of speed and storage capacity are random-access memories, which are somewhat faster than the disk 
or tape devices, but which are also more expensive on a storage cost per unit of storage basis. At the 
fastest end of the hierarchy are cache memories, which are also the most expensive and thus generally 
the smallest. 

Generally, during processing operations, a processor will enable information to be processed 
to be copied fi-om the slower devices to the increasingly faster devices for faster retrieval. Generally, 
transfers between, for example, disk devices and random-access memories are in relatively large 
blocks, and transfers between the random-access memories and cache memories are in somewhat 
smaller "cache lines." In both cases, information is copies to the random-access memory and cache 
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1 memory on an "as needed" basis, that is, when the processor determines that it needs particular 

2 information in its processing, it will enable blocks or cache lines which contain information to be 

3 copied to the respective next faster information storage level in the memory hierarchy. Certain 

4 prediction methodologies have been developed to attempt to predict the whether a processor will 

5 need information for processing before it (that is, the processor) actually needs the information, and 

6 to enable the information to be copied to the respective next faster information storage level. 

7 However, generally at some point in the processing operations, the processor will determine that 

8 information required for processing is not available in the faster information storage level, that is, a 

9 "read miss" will occur, and it (that is, the processor) will need to delay its processing operations until 
10 the information is available. Generally, the rate at which read misses will occur with storage 

3 

gl 1 element(s) at a particular level in the hierarchy will be related to the storage capacity of the storage 

■^fl2- element(s) at the particular level, as well as the pattern with which the processor accesses the 

y 

J13 information in the respective storage level In any case, to enhance the processing efficiency of a 

iLl4 digital data processing system, it is generally helpful to be able to assess the effect of changing the 

^"15 capacity of the memory element(s) at a particular level in the memory hierarchy on the rate of read 

316 misses at the particular level 

J 

a 7 Summary Of The Invention 

18 The invention provides a new and improved system and method for providing a prediction of 

19 the operational effectiveness of a cache memory of a particular size in a digital data processing 

20 system. The invention facilitates the efficient determination of the likely effectiveness of the cache 

21 memory for various cache memory sizes, based on a prediction of the likely cache miss rate, the 

22 prediction being based on operational statistics which are gathered during actual use of the cache 

23 memory over one or more time periods, and based on a variety of cache management methodologies. 

24 Based on the prediction, the operator or the system can facilitate increasing or decreasing the size of 

25 the cache memory, or maintaining the cache memory at its then-current size. 
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1 The system determines the cache memory's read miss rate from statistics that are collected 

2 during use of the cache memory over an arbitrary time interval, including statistics concerning the file 

3 information retrieval activity and the extent of activity per unit time for system. Based on the 

4 statistics, equations, which are based on the respective cache memory management methodology, 

5 including the FIFO (first-in/first-out) methodology or the LRU (least-recently used) methodology, 

6 used in managing the cache memory, are solved to generate a prediction of the cache miss rate for 

7 a particular cache memory size, v^hich may be larger or smaller than the current cache memory size, 

8 , and for the particular cache memory management methodology. The system can repeat this a number 

9 of times over respective time intervals to determine corresponding predictions based on the cache 
10 memory utilization for respective sets of statistics determined during each time interval. Thereafter, 

Sgl 1 the system or an operator can effect a change in the cache memory size based on the cache miss rate 

f'=!l2- predictions. 

I J 

1*13 Brief Description Of The Drawings 

if 

^ il4 This invention is pointed out with particularity in the appended claims. The above and fijrther 

1315 advantages of this invention may be better understood by referring to the foUov^ing description taken 

L il6 in conjunction v^ith the accompanying drav^ngs, in which: 

17 FIG. 1 is a functional diagram of a system for assessing the operational effectiveness of a 

18 cache memory in a digital data processing system; 

19 FIG. 2 is a functional block diagram of an illustrative digital data processing system with 

20 which the cache assessment system depicted in FIG. 1 can be used; and 

21 FIG. 3 is a flow diagram illustrating operations performed by the cache assessment system 

22 depicted in FIG. 1. 

23 Detailed Descmption of an Illustrative Embodiment 
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The invention provides a new system 10 for generating an assessment as to the operational 
effectiveness of a cache memory operating in a digital data processing system. In an illustrative 
embodiment, the system 10 includes a suitably programmed digital computer system, v^hich will be 
described in connection with FIG. 1. The computer system performs a number of processing 
operations, as will be described below in connection with the flow chart depicted in FIG. 3, in 
connection with operational statistics which are generated during operations in connection with a 
respective cache memory to generate the operational effectiveness assessment. An illustrative digital 
data processing system including a cache memory for which the cache assessment generating system 
10 generates an operational assessment will be described in connection v^th FIG. 2. 

With initial reference to FIG. 1, the cache assessment system 10 in one embodiment includes 
digital computer system including a processor module 1 1 and operator interface elements comprising 
operator input components such as a keyboard 12A and/or a mouse 12B (generally identified as 
operator input element(s) 12) and an operator output element such as a video display device 13, The 
illustrative computer system is of the conventional stored-program computer architecture. The 
processor module 1 1 includes, for example, processor, memory and mass storage devices such as disk 
and/or tape storage elements (not separately shown) which perform processing and storage 
operations in connection with distal data provided thereto. In addition, the processor module 1 1 can 
include one or more network ports which are connected to communication links which connect the 
computer system in a computer network. The network ports enable the computer system to transmit 
information to, and receive information firom, other computer systems and other devices in the 
network. The operator input element(s) 12 are provided to permit an operator to input information 
for processing. The video display device 13 is provided to display output information generated by 
the processor module 1 1 on a screen to the operator, including data that the operator may input for 
processing, information that the operator may input to control processing, as well as information 
generated during processing. The processor module 1 1 generate information for display by the video 
display device 13, in one embodiment using a so-called "graphical user interface" ("GUI"). Although 
the computer system is shown as comprising particular components, such as the keyboard 12A and 
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1 mouse 12B for receiving input information from an operator, and a video display device 13 for 

2 displaying output information to the operator, it will be appreciated that the computer system may 

3 include a variety of components in addition to or instead of those depicted in FIG. 1 . 

4 As noted above, the cache assessment system 10 constructed in accordance with the invention 

5 generates an assessment as to the operational effectiveness of a cache memory in a digital data 

6 processing system. An illustrative such digital data processing system 14 is depicted in FIG. 2. With 

7 reference to FIG. 2, digital data processing system 14 includes a plurality of host computers 15(1) 

8 ' through 15(N) (generally identified by reference numeral 15(n)) and a digital data storage subsystem 

9 ^16 interconnected by a common bus 17. Each host computer 15(n) may comprise, for example, a 
C 310 personal computer, workstation, or the like which may be used by a single operator, or a multi-user 
Igll computer system which may be used by a number of operators. Each host computer 15(n) is 

Hl2 connected to an associated host adapter 24(n), which, in turn, is connected to bus 17. Each host 

|y 

^.Sl3 computer 15(n) may control its associated host adapter 24(n) to perform a retrieval operation, in 

1 3 14 which the host adapter 24(n) initiates retrieval of computer programs and digital data (generally, 

lJ5 "information") from the digital data storage subsystem 16 for use by the host computer 15(n) in its 

^^216 processing operations. In addition, the host computer 15(n) may control its associated host adapter 

,' 217 24(n) to perform a storage operation in which the host adapter 24(n) initiates storage of processed 

18 data in the digital data storage subsystem 16. Generally, retrieval operations and storage operations 

19 in connection with the digital data storage subsystem 16 will collectively be referred to as "access 

20 operations." 

21 In connection with both retrieval and storage operations, the host adapter 1 5(n) will transfer 

22 access operation command information, together with processed data to be stored during a storage 

23 operation, over the bus 17. Access to the bus 1 7 is controlled by bus access control circuitry which, 

24 in one embodiment, is integrated in the respective host adapters 24(n). The bus access control 

25 circuitry arbitrates among devices connected to the bus 17 which require access to the bus 17. In 

26 controlling access to the bus 17, the bus access control circuitry may use any of a number of known 

27 bus access arbitration techniques. 
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The digital data storage subsystem 16 in one embodiment is generally similar to the digital 
data storage subsystem described in U. S, Patent No. 5,206,939, entitled System And Method For 
Disk Mapping And Data Retrieval, issued April 27, 1993 to Moshe Yanai, et al (hereinafter, "the '939 
patent"). As shown in FIG. 2, the distal data storage subsystem 16 includes a plurality of digital data 
stores 20(1) through 20(M) (generally identified by reference numeral 20(m)), each of which is also 
connected to bus 17. Each of the data stores 20(m) stores information, including programs and data, 
which may be accessed by the host computers 15(n) for processing, as well as processed data 
. provided to the digital data storage subsystem 16 by the host computers 15(n). 

Each data store 20(m), in turn, includes a storage controller 21(m) and one or more storage 
devices generally identified by reference numeral 22. The storage devices 22 may comprise any of 
the conventional magnetic disk and tape storage devices, as well as optical disk storage devices and 
CD-ROM devices from which information may be retrieved. Each storage controller 21(m) connects 
to bus 17 and controls the storage of information which it receives thereover in the storage devices 
connected thereto. In addition, each storage controller 21(m) controls the retrieval of information 
from the storage devices 22 which are connected thereto for transmission over bus 17, and in one 
embodiment includes bus access control circuitry for controlling access to bus 17. 

The digital data storage subsystem 16 also includes a common memory subsystem 30 for 
caching information during an access operation and event status information providing selected status 
information concerning the status of the host computers 15(n) and the data stores 20(m) at certain 
points in their operations. The caching of event status information by the common memory 
subsystem 30 is described in detail in U. S. Patent Appn. Ser. No. 08/532,240 filed September 25, 
1995, in the name of Eli Shagam, et al., and entitled Digital Computer System Including Common 
Event Log For Logging Event Information Generated By A Plurality of Devices (Atty. Docket No. 
95-034) assigned to the assignee of the present invention and incorporated herein by reference. The 
information cached by the common memory subsystem 30 during an access operation includes data 
provided by a host computer 15(n) to be stored on a data store 20(m) during a storage operation, as 
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1 well as data provided by a data store 20(m) to be retrieved by a host computer 15(n) during a 

2 retrieval operation. 

3 The common memory subsystem 30 effectively operates as a cache to cache information 

4 transferred between the host computers 15(n) and the data stores 20(m) during an access operation. 

5 The common memory subsystem 30 includes a cache memory 3 1, a cache index directory 32 and a 

6 cache manager 33, which are generally described in U. S. Pat, Appn. Ser. No. 07/893,509 filed June 

7 4, 1995, in the name of Moshe Yanai, et al., entitled "System And Method For Dynamically 

8 * Controlling Cache Management," and U. S. Pat. Appn. Ser. No. , filed September 2, 

9 , 1995, in the name of Eli Shagam, and entitled Average Flow-Through Time In Cache (Atty. Docket 
|3lO No. 95-032) (hereinafter referred to as the "Shagam application"), both of which are assigned to the 
Ipl assignee of the present invention and incorporated herein by reference. The cache memory 31 
iJl2 operates as a buffer in connection with storage and retrieval operations, in particular buffering 
.^S13 information received fi"om the data stores 20(m) requested by the host computers 15(n) for 
1^14 processing, as well as information received from the host computers 15(n) to be transferred to the 
^15 storage devices for storage. 

□ 16 In operation, when a host computer 15(n) wishes to retrieve information from the storage 

;'n7 subsystem 16, it initially enables its host adapter 24(n), in particular a cache manager 25(n) associated 

Hi 18 with the host adapter 24(n), to determine whether the mformation to be retrieved is in the cache 

19 memory 3 1 . If the information to be retrieved is in the cache memory 31, that is, if a "read hit" occurs 

20 in connection with the cache memory 3 1, the cache manager 25(n) will retrieve the information from 

21 the cache memory 31 and transfer it to the host computer 15(n). On the other hand, if the 

22 information to be retrieved is not in the cache memory 31, that is, if a "read miss" occurs in 

23 connection with the cache memory 31, the cache manager 25(n) will enable a cache manager 23(m) 

24 associated with the data store 20(m) whose storage device 22 which contains the information to be 

25 retrieved to perform a "staging operation" to transfer a portion of a file containing the information 

26 to be retrieved from the storage device 22 to the cache memory 3 1 . After the portion of the file has 

27 been transferred from the storage device 22 to the cache memory 3 1 during the staging operation. 
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1 the data store's cache manager 23(m) will notify the host adapter's cache manager 25(n), after which 

2 the host adapter's cache manager 25(n) can retrieve the information from the cache memory 3 1 . 

3 Similar operations are performed in connection with a storage operation, in which information 

4 in a file is updated. In particular, during a storage operation, the host adapter 24(n) will enable its 

5 cache manager 25(n) to initially determine whether at least the portion of the file to be updated is in 

6 the cache memory 3 L If the portion of the file to be updated is in the cache memory, that is, if a 

7 "write hit" occurs in connection with the cache memory 3 1, the cache manager 25(n) will store the 

8 updated information in the cache memory 31. On the other hand, if the cache manager 25(n) 

9 , determines that the portion of the file to be updated is not in the cache memory 3 1 , that is, if a "write 
£ 4 0 miss" occurs in connection with the cache memory 3 1 , the cache manager 25(n) will enable the cache 

1^ manager 23(m) associated with the data store 20(m) whose storage device 22 which contains the 

|ijl2 portion of the file to be updated to perform a "staging operation" to transfer the data fix>m the storage 

51 3 device 22 to the cache memory 3 1 . After the portion of the file has been transferred from the storage 

|;J4 device 22 to the cache memory 3 1 during the staging operation, the data store's cache manager 23(m) 

r. 1 5 will notify the host adapter's cache manager 25(nX after which the host adapter's cache manager 25(n) 

C3 

l§.6 can store the updated information in the cache memory 3 1 . 

^ It will be appreciated that the efficiency of the digital data processing system 14 will generally 

h|18 be enhanced if the rate at which, in particular, read misses in connection with cache memory 3 1 can 

19 be reduced. The rate at which read misses occur is of importance, relative to write misses, since read 

20 misses can slow down the rate at which the host computers 15(n) will be able to obtain information 

21 for processing, whereas write misses will just slow down the rate at which updated information is 

22 stored in the storage subsystem 16. The cache assessment system 10 provides an assessment as to 

23 the effectiveness of the common memory subsystem 30 in operating as a cache, and in particular 

24 generates an assessment as to the changes in read misses which may occur if the cache memory 3 1 

25 is increased or decreased in size, 

26 The cache assessment system 10 determines the cache memory's read miss rate from statistics 

27 that are collected during operation of the digital data processing system 14, including statistics 
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concerning the file information retrieval activity and the extent of activity per unit time for digital data 
processing system 14. The file access activity, which will be represented by A^, measures the number 
of times the host computers 15(n) issued requests to retrieve information from the particular "i-th" 
file, including those retrieval requests for which the information was already in the cache memory 3 1 
and those retrieval requests for which staging operations were required to transfer the information 
fi"om the storage device 22 to the cache memory 3 1 . The extent of activity, which will be represented 
by Ei, refers to the amount of the respective "i-th" file which is active. Both statistics and Ej are 
gathered over an arbitrary time interval, and represent the activity and extent of activity over the 
particular time interval. Assuming that 

(a) the activity is spread uniformly over a particular extent, which can occur if the host 
computers 15(n) randomly issue retrieval requests for information fi^om the respective extent; 

(b) the digital data processing system 14 caches all read misses in cache memory 31, that is, 
during sta^g operations in connection with information fi*om a file follovdng determinations by the 
respective host adapters 24(n) that the information was not already in the cache memory 31; and 

(c) the cache memory 31 is managed on a first-in first-out (FIFO) basis, that is, information 
is removed fi"om the cache in the order in which it is loaded into the cache (generally, if the cache 
memory 3 1 is large, other cache management methodologies, such as the "least-recently used" 
methodology, will approximate FIFO management) 

then, if Pj represents the percentage of the cache memory 3 1 which is occupied by information fi*om 
a file "i," at each point in time the sum of the percentages of all of the files "i" that are stored in the 
storage devices 22 of storage subsystem 16 which are cached in the cache memory 3 1 equals one, that 
is, 



i 



(1) 
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Based on assumptions (b) and (c) above, that is, based on the assumption that the digital data 
processing system 14 caches all read misses in the cache memory 31, stores the staged information 
retrieved from a file stored on a storage device caches all of the read misses in the cache memory 3 1, 
and based further on the assumption that the cache memory 3 1 is managed on a FIFO basis, then 
the percentage of the "i-th" file that is cached in the cache memory 31 at any point in time 
corresponds to the ratio between the number of read misses in connection with the "i-th" file and the 
total number of read misses, that is, 



Pi = — 
' M 



(2), 



where represents the number of read misses in connection with file "i" per unit time and "M" 
represents the total number of read misses which occur in the digital data processing system 14 per 
unit time. At any point in time, the portion of the "i-th" file which is cached in the cache memory 3 1 
PS 

corresponds to _ L. and so the portion of the file which is not cached in the cache memory 31 

E. 



PS 

corresponds to i - Based on assumption (a) above, that is, that activity rate in connection 



with a file is spread uniformly over the file, the number of read misses in connection with file "i" per 
unit time, M^, in turn, is related to the portion of the file which is not in the cache memory 3 1 at any 
point in time, times the activity rate A^, or 



M, = 



1 " 



P.S\ 



(3). 
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1 Combining equations (2) and (3), the multiplicative product of the Pj, the percentage of the 

2 cache memory 3 1 which is occupied by information from the "i-th" file over each time interval, times 

3 the total number of read misses over the time interval corresponds to 



P^M = M. = 



(4). 



5 Rearranging equation (4), 



P.M 



P — - - A, 



(5) 



and solving equation (5) for Pj, 

A. 



9 



Combining equation (6) and equation (1), 
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1 = E^, = E 



M + — - 

E.. 



(7) 



which is a fiinction of "M," the total number of read misses per time interval, the size "S" of the cache 
and the statistics and which are generated as described above. 

Equation (7) effectively gives a prediction as to the number of read misses in connection with 
cache memory 31 as a function of the size "S" of the cache memory 3 1, for the particular types of 
processing operations which are performed by the digital data processing system 14 during the time 
interval over which the statistics and Ej were collected. Accordingly, for any particular size "S" 
as selected by the operator of the cache assessment system 10, equation (7) can be solved for "M" 
to provide a prediction as to the effect on the number of read misses which would occur for the 
particular cache memory size S. That is, for any particular value of "S," the size of the cache memory 
31, the number of read misses can be predicted by providing that value in equation (7) and solving 
for "M." 

Equation (7) can be solved for "M" in a number of ways. Equation (7) can be efficiently 
solved for "M" using a binary search arrangement, as will be evident from the following. First, it will 
be recognized that, generally, the sum in the right-hand side of equation (7) is a function of "M" and 
"S," that is. 



96-108 

-13- 

1 is monotonically decreasing for positive values of "M" and "S." Differentiating equation (8) with 

2 respect to "M," 



f'{M,S) = E 7 



-A 



SA\ 
M + — '- 



(9) 



4 where f (M,S) represents the derivative of f(M,S) with respect to M. Since Aj, Ej, M and S are all 

5 positive numbers, it will be appreciated that the derivative is negative for all positive values of "M" 

6 and "S", and so f(M,S) as shown in equation (8) is monotonically decreasing for all values of "S." 

7 Thus, for each value of "S" there will be one real value for "M." In addition, it v^l be recognized that 

8 the value of "M" will fall in the interval 

9 o<m<j:^^. =^ .... 



10 (where "A" represents the total activity over the tune interval), since the number of read misses in a 

1 1 time interval cannot be negative and cannot exceed the total activity over the time interval Any 

12 conventional methodology can be used to determine or approximate the solution to equation (7), 

13 including, for example, a conventional binary search methodology over the interval defined by 

14 equation (10). 

15 Using equations (7) and (10), an operator of the cache assessment system 10 can assess the 

16 effectiveness, if any, of increasing or decreasing the size of the cache memory 3 1 fi:*om its current size, 

17 based on the particular types of processing activities performed during the time interval over which 
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1 the statistics Aj and were collected. Using equations (7) and (10) over a plurality of time intervals, 

2 during which the mixture of types of processing operations will likely change, the operator can 

3 determine the effectiveness, if any, of increasing or decreasing the size of the cache memory 3 1 for 

4 the diverse mixtures of processing operations which the digital data processing apparatus 14 may be 

5 called upon to perform. 

6 As noted above, the model described in connection with equations (1) through (9) 

7 encompasses assumptions (a) and (b) above, namely, that the activity in connection with an accessed 

8 file is spread uniformly over the respective file (assumption (a)), and that the digital data processing 

9 system 14 caches all read misses in cache memory 3 1 (assumption (b)). For at least information to 

10 be processed (in contrast to program instructions which may be stored in the storage subsystem 16 

1 1 and retrieved by the host computers 15(n)), generally assumption (b) is correct, that is, generally such 

12 information retrieved fi-om the storage devices in connection with a staging output following a read 

13 miss will be cached in the cache memory 3 1 . However, activity in connection with an accessed file 

14 does not need to be uniformly distributed uniformly over the entire file, and so assumption (a) may 

15 not be correct. 

16 To extend the methodology to a model in which assumption (a) is eliminated, it will be 

17 assumed that each file may comprise one or more relatively short, non-overlapping, and possibly 

18 variable-length "packets." When the an access request is issued to particular packet is referenced, 

19 the packet may be re-referenced within a relatively short period of time. As an illustration which will 

20 assist in clarifying these assumptions, in a file containing banking information for customers of a bank, 

21 each of the packets may comprise information concerning a particular account or customer. In that 

22 case, the packets may have differing lengths, based on the amount of banking the customer has done 

23 with the particular bank. In addition, typically the packets will be accessed when, for example, the 

24 customer's account is updated by the bank or when the customer calls for information concerning the 

25 account, and will otherwise not be accessed. In that case, when a packet is accessed, a number of 

26 references may be made to the packet within a short time, that is, while the bank is performing the 

27 update operation or during the call. In this extended methodology, the activity and extent statistics 
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Aj and Ej , are still determined on a file basis; however, a further statistic is generated, namely, a 
reference average Rj , which indicates the average number of times a packet is referenced during each 
time interval, that is, the number of times a host computer 15(n) retrieves information firom the 
packet. (A methodology for determining the value of will be described below.) It will be 
appreciated that the reference average R; includes the initial reference, which would give rise to a 
cache miss. Under these assumptions, equation (4) becomes 



P.M = M. = 



i P.S\ 
1 - 



(11) 



where H(i,S) corresponds to the number of cache hits per packet. (A methodology for determining 
the value of H(i,S) will be described below,) In equation 1 1, 



(a) the term ' 



1 - 



as in equation (4), is an indication of the amount of the amount 



of the "i-th" file that is not in the cache, and so 



-J. is an indication of the number of 

R. 



cache misses on the first access request in connection with a packet; and 

(b) the term " (i? . - 1 - H(i,S) ) — " is an indication of the number of cache misses on 



subsequent references in connection with a packet. 



Rearranging equation (11) in the same manner as equation (4) above. 
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P.. 



M + 



A_iZ. 



- l-H(i,S)) 



(12) 



and solving equation (12) for , 



CO 3 



XV, 



M + 



^ — 



(13) 



4 Since, as above in connection with equation (1), it is assumed that the cache memory 31 is My 

5 populated with information from the data stores 20(m), 



1 = Ep< = 



-^(R, -1 -H(i,S)) 
M + 



(14), 
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which, as in equation (7), provides the cache miss ratio "M" as a function of the cache size "S" and 
the statistics developed over the time interval. 

Using a methodology similar to that described above in connection with equation (7), equation 
14 can be solved in the following manner. From the following, it will be clear that equaiton (14) 
provides one real solution of "M" for each value of "S." First, it vwU be recognized that, generally, 
the sum in the right-hand side of equation (14) is a function of "M" and "S," that is. 



is monotonically decreasing for positive values of "M" and "S." Differentiating equation (15) with 
respect to "M," 



-i(i?, -1 -H(i,S)) 



f(M,S) = D 




(15). 



--^(R. -1 -H(i,S)) 



f\M,S) = D 



(A 



2 



(16) 



M + 
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where f (M,S) represents the derivative of f(M,S) with respect to M. Since Aj, Ej, M and S are all 
positive numbers, it will be appreciated that the derivative is negative for all positive values of "M" 
and "S", and so f(M,S) as shown in equation (8) is monotonically decreasing for all values of "S." 
Thus, for each value of "S" there will be one real value for "M." In addition, it will be recognized that 
the value of "M" will fall in the interval 

0<M<Y:A,=A (17) 



(where "A" represents the totd activity over the time interval), since the number of read misses in a 
time interval cannot be negative and cannot exceed the total activity over the time interval. Any 
conventional methodology can be used to determine or approximate the solution to equation (14), 
including, for example, a conventional binary search methodology over the interval defined by 
equation (17). 

Using equations (14) and (17), an operator of the cache assessment system 10 can assess the 
eflFectiveness, if any, of increasing or decreasing the size of the cache memory 3 1 fi-om its current size, 
based on the particular types of processing activities performed during the time interval over which 
the statistics A^ and Ei were collected. Using equations (14) and (17) over a plurality of time 
intervals, during which the mixture of types of processing operations will likely change, the operator 
can determine the effectiveness, if any, of increasing or decreasing the size of the cache memory 3 1 
for the diverse mixtures of processing operations which the digital data processing apparatus 14 may 
be called upon to perform. 

As noted above, the methodologies described above in connection with equations 1 through 
10 and 11 through 17 both assume that the cache memory 31 is operated using a FIFO cache 
operating methodology (assumption (c) above). While the FIFO methodology can be useful in 
approjdmating conditions in cache memories which use other methodologies, particualrly if the cache 
memories are relatively large, a more accurate methodology for use with a cache memory which is 
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operated using the "LRU" (least recently used) cache management methodology, will be described 
in connection with equations (18) through (21) below. As is usual in the LRU cache management 
methodology, when a cache miss occurs and information is staged into the cache memory 31, the 
information that is replaced during the staging operation is the information that was least recently 
accessed by, for example, a host adapter 24(n). To maintain the LRU ordering, when information 
in the cache memory 31 is accessed, the accessed information's position in the LRU ordering is 
promoted to the top of the LRU ordering. Thus, since a time interval "I," there are M/I cache slots 
in the cache memory 3 1 replaced duimg the time interval, the amouQt of time that an extent will 
remain in the cache memory 3 1 after the last time the extent was referenced v^U correspond to the 
total size of the cache memory 31, "S," divided by the number of cache slots that are replaced duimg 
a time interval "M/I. " That is, the amount of time that an extent will remain i the cache memory 3 1 
after it was last reference corresponds to S/(M/I). Accordingly, if the the time interval between the 
first time the extent is accessed and the last time the extent is accessed is "Tj, then the total time that 

the extent will remain in the cache memory 3 1 is 7 + ^ = t + Thus, the percentage of 



M) 
I 



M 



the "i-th" file in the cache memory 31 at any point in time (reference equation (2)) above corresponds 
to 



P. = 



' m] (18). 

SI 



If the cache memory 3 1 is large, then 
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1 - 



E. 
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analogous to equations (3) and (1 1). From equations (1), (18) and (19) 



1 - Tp< = E 



^ ps^ 
1 - — 

E. 



R.[ ' M 



(20). 



SI 



Rearranging equaiton (20), 



1 = E^.= E 



A T 



R 



(A SI 



R 



' SA T\ 
SI + 



^R . 



Al\ 



(21), 



which has a form similar to equations (7) and (14), and can be solved in a similar manner using a 
binary search technique. Using equations (14) and (17), an operator of the cache assessment system 
10 can assess the effectiveness, if any, of increasing or decreasing the size of the cache memory 31 



96-108 

-21- 

1 from its current size, based on the particular types of processing activities performed during the time 

2 interval over which the statistics and were collected. Thus, using equation (21), along with 

3 equations (10) and (17) (which define the effective range for "M," the number of cache misses) over 

4 a plurality of time intervals, during which the mixture of types of processing operations will likely 

5 change, the operator can determine the effectiveness, if any, of increasing or decreasing the size of 

6 the cache memory 31 for the diverse mixtures of processing operations which the digital data 

7 processing apparatus 14 may be called upon to perform. 

8 As noted above, the second and third methodologies, described above in connection with 

9 equations 1 1 through 17 and 18 through 21, respectively, require values for variable R^, H(i,S) and 
1310 to be determined over respective time intervals. Such values can be determined as follows. The 
1^1 1 value of Tj, the times between the first and last access to a cached extent in the cache memory 3 1, can 
I 12 be determined by providing a time stamp for each cache slot, which includes an initial value when an 
,.513 extent is assigned to the cache slot, and an access value that is updated each time the cache slot is 
fil4 accessed. In that case, when the cache slot is re-used for another extent, the value of Tj for the 
IJ^S pre\dous extent vM correspond to the difference between the access value and the initial value. 

Ql6 The reference average Rj , which indicates the average number of times a packet in a cache 

^'.^17 slot is accessed during each time interval, and H(i,S), the number of cache hits per packet, can be 

J 18 determined by establishing respective tables having an entry associated with each packet, which are 

19 incremented when respective packet is accessed. When the cache slot is re-used for another extent, 

20 the number of times the extent was accessed while the packet was in the cache slot can be 

21 determined. The respective tables can, for example, have one entry for each extent, or, alternatively, 

22 may be in the form of a hash table having a plurality of entries which are hashed to identifiers for the 

23 extents in a conventional manner. 

24 With this background operations performed by the cache assessment system 10 will be 

25 described in connection with the flowchart in FIG. 3. With reference to FIG. 3, the cache assessment 

26 system 10 initially collects the operational statistics during operation of the digital data processing 

27 system 14 over a selected time period (step 100). If the cache assessment system 10 uses the first 
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1 model, described above in connection with equations 1 through 10, it need only collect the activity 

2 and extent statistics and described above. On the other hand, if the cache assessment system 

3 10 uses the second model, it will additionally need to collect the re-reference and cache hit statistics 

4 Ri and H(i,S) over the time interval 

5 After the time interval has terminated, the cache assessment system 10 will apply the statistics 

6 to the appropriate equation (8) or (14), depending on the selected model, for various sizes "S" of 

7 cache memory 3 1 to generate respective predictions as to the number of read misses for each of the 

8 respective cache sizes "S" (step 101), Based on the respective predictions, the cache assessment 

9 system 10 or the operator can determine whether to adjust the size of the cache memory 3 1 (step 
plO 102) and, if so, initiate operations to enable the size adjustment to occur. 

K Ol 1 The invention provides a number of advantages. In particular, the invention provides a system 

I J 12 and method for assessing the effect of read misses in connection with a cache memory used in a 

^Tl3 digital data proces^ system, as a function of the size of the cache memory, using statistics that are 

1-^14 generated during processing operations of the digital data processing system. Based on the 

C 3 1 5 assessment, a determination can be made as to the utility of providing a cache memory of a particular 

i; 3 1 6 size in the digital data processing system. 

y 17 It will be appreciated that a number of modifications may be made to the cache assessment 

18 system 10 described above. For example, although the system 10 has been described in connection 

19 with a particular digital data processing system, it will be appreciated that the system 10 will find 

20 utility in connection with digital data processing systems of numerous architectures. 

21 It will be appreciated that a system in accordance with the invention can be constructed in 

22 whole or in part fi-om special purpose hardware or a general purpose computer system, or any 

23 combination thereof, any portion of which may be controlled by a suitable program. Any program 

24 may in whole or in part comprise part of or be stored on the system in a conventional manner, or it 

25 may in whole or in part be provided in to the system over a network or other mechanism for 

26 transferring information in a conventional manner. In addition, it will be appreciated that the system 
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may be operated and/or otherwise controlled by means of information provided by an operator using 
operator input elements (not shown) which may be connected directly to the system or which may 
transfer the information to the system over a network or other mechanism for transferring information 
in a conventional manner. 

The foregoing description has been limited to a specific embodiment of this invention. It will 
be apparent, however, that various variations and modifications may be made to the invention, with 
the attainment of some or all of the advantages of the invention. It is the object of the appended 
claims to cover these and such other variations and modifications as come within the true spirit and 
scope of the invention. 

What is claimed as new and desired to be secured by Letters Patent of the United States is: 
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Claims 



1 1 . A system for generating an operational assessment of a cache memory in a digital data processing 

2 system for respective cache memory sizes comprising: 

3 A. an operational statistics gathering element for gathering operational statistics over a time 

4 interval, including a file information retrieval activity value and a extent of activity value for 

5 each file accessed during the time interval; 

6 B. a cache miss prediction element for generating a cache miss prediction value in response to 
|f| 7 the operational statistics gathered by the operational statistics gathering element and a cache 

8 memory size value; and 

%^ 
W 

9 C. a cache memory size adjustment element for adjusting the cache memory size in response to 
I ilO the cache memory size vahie generated by the cache miss prediction element for a selected 
J^l 1 one of said cache miss prediction values. 

f ; !?? 

J ^ 1 2. A system as defined in claim 1 in which the cache miss prediction element generates the cache miss 

2 prediction value based on a particular one of a plurality of cache memory management 

3 methodologies. 

1 3. A system as defined in claim 2 in which one of said cache memory management methodologies is 

2 a FIFO (first-in/first-out) methodology, the cache miss prediction element generating the cache miss 

3 prediction value in accordance with: 
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r SA 

M + — 



where "M" represents the cache miss prediction value, "S" represents the selected cache memory size 
value, "Ai" represents the file retrieval activity value for a file "i," and "E;" represents the extent of 
activity value for the file "i." 

4. A system as defined in claim 3 in which the cache miss prediction element determines the cache 
miss prediction value "M" using a binary search methodology over the interval 

0 < M < = A 



where "A" represents the total activity over the time interval 

5. A system as defined in claim 2 in which one of said cache memory management methodologies is 
a FIFO (first-in/first-out) methodology, the operational statistics gathering element fiirther gathering 
a packet re-reference value indicating a number of times a portion of a file, identified as a packet, is 
referenced during the time interval, the cache miss prediction element generating the cache miss 
prediction value in accordance with: 



96-108 



-26- 



-1 -H(i,S)) 



M + 



where "M" represents the cache miss prediction value, "S" represents the selected cache memory size 
value, "Aj" represents the file retrieval activity value for a file "i," "Ej" represents the extent of activity 
value for the file "i," and " represents the packet re-reference value for file "i." 



6, A system as defined in claim 5 in which the cache miss prediction element determines the cache 
miss prediction value "M" using a binary search methodology over the interval 

0 < M < Y^A. = A 



where "A" represents the total activity over the time interval 



7. A system as defined in claim 2 in which one of said cache memory management methodologies is 
an LRU (least-recently used) methodology, the operational statistics gathering element fiirther 
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gathering a packet re-reference value indicating a number of times a portion of a file, identified as a 
packet, is referenced during the time interval, the cache miss prediction element generating the cache 
mis prediction value in accordance with: 



M + 



SI + 




f A^] 


M + 


V • • J 







where "M" represents the cache miss prediction value, "S" represents the selected cache memory size 
value, "Ai" represents the file retrieval activity value for a file "i," "E/' represents the extent of activity 
value for the file "i," "Ri " represents the packet re-reference value for file "i," and "I" represents the 
duration of the time interval. 



8. A system as defined in claim 7 in which the cache miss prediction element determines the cache 
miss prediction value "M" using a binary search methodology over the interval 

0 <M < X) A = A 



where "A" represents the total activity over the time interval. 
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9. A method for generating an operational assessment of a cache memory in a digital data processing 
system for respective cache memory sizes comprising the steps of 

A. gathering operational statistics over a time interval, including a file information retrieval 
activity value and a extent of activity value for each file accessed during the time interval; 

B. generating a cache miss prediction value in response to the operational statistics gathered 
during the operational statistics gathering step, and a cache memory size value; and 

C. adjusting the cache memory size in response to the cache memory size value generated during 
the cache miss prediction step for a selected one of said cache miss prediction values. 

10. A method as defined in claim 9 in which during the cache miss prediction step the cache miss 
prediction value based on a particular one of a plurality of cache memory management 
methodologies. 

1 1. A method as defined in claim 10 in which one of said cache memory management methodologies 
is a FIFO (first-in/first-out) methodology, during the cache miss prediction step the cache miss 
prediction value being generated in accordance with: 
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where "M" represents the cache miss prediction value, "S" represents the selected cache memory size 
value, "Aj" represents the file retrieval activity value for a file "i," and "E;" represents the extent of 
activity value for the file "i." 

12. A method as defined in claim 1 1 in which, during the cache miss prediction step, the cache miss 
prediction value "M" being generated using a binary search methodology over the interval 

0 < M ^ = A 

i 

where "A" represents the total activity over the time interval. 

13. A method as defined in claim 10 in v^hich one of said cache memory management methodologies 
is a FIFO (first-in/first-out) methodology, the operational statistics gathering element further 
gathering a packet re-reference value indicating a number of times a portion of a file, identified as a 
packet, is referenced during the time interval, during the cache miss prediction step the cache miss 
prediction value being generated in accordance with: 
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where "M" represents the cache miss prediction value, "S" represents the selected cache memory size 
value, "Ai" represents the file retrieval activity value for a file "i," "R" represents the extent of activity 
value for the file "i," and "Rj " represents the packet re-reference value for file "i." 

14. A method as defined in claim 13 in which, during cache miss prediction step, the cache miss 
prediction value "M" being generated using a binary search methodology over the interval 



where "A" represents the total activity over the time interval. 

15. A method as defined in claim 10 in which one of said cache memory management methodologies 
is an LRU (least-recently used) methodology, during the operational statistics gathering step a packet 
re-reference value being further gathered indicating a number of times a portion of a file, identified 
as a packet, is referenced during the time interval, the cache miss prediction element generating the 
cache mis prediction value in accordance with: 




M + 



(A. SI] 



R. 



R. 
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where "M" represents the cache miss prediction value, "S" represents the selected cache memory size 
value, "Aj" represents the file retrieval activity value for a file "i," represents the extent of activity 
value for the file "i," "Rj " represents the packet re-reference value for file "i," and "I" represents the 
duration of the time interval. 

16. A method as defined in claim 15 in which during the cache miss prediction step the cache miss 
prediction value "M" being generated using a binary search methodology over the interval 

0 < M < 52^, = A 



where "A" represents the total activity over the time interval. 
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Abstract Of The Disclosure 



A system efficiently determines of the likely effectiveness of the cache memory for various 
cache memory sizes, based on a prediction of the likely cache miss rate, the prediction being based 
on operational statistics which are gathered during actual use of the cache memory over one or more 
time periods, and based on a variety of cache management methodologies. Based on the prediction, 
the operator or the system can facilitate increasing or decreasing the size of the cache memory, or 
maintaining the cache memory at its then-current size. The system determines the cache memory's 
read miss rate from statistics that are collected during use of the cache memory over an arbitrary time 
interval, including statistics concerning the file information retrieval activity and the extent of activity 
per unit time for system. Based on the statistics, equations, which are based on the respective cache 
memory management methodology, including the FIFO (first-in/first-out) methodology or the LRU 
(least-recently used) methodology, used in managing the cache memory, are solved to generate a 
prediction of the cache miss rate for a particular cache memory size, which may be larger or smaller 
than the current cache memory size, and for the particular cache memory management methodology. 
The system can repeat this a number of times over respective time intervals to determine 
corresponding predictions based on the cache memory utilization for respective sets of statistics 
determined during each time interval Thereafter, the system or an operator can effect a change in 
the cache memory size based on the cache miss rate predictions. 



/^00. CACHE ASSESSMENT SYSTEM 10 
COLLECTS THE OPERATIONAL STATISTICS 
DURING OPERATION OF THE DIGITAL DATA 
PROCESSING SYSTEM 14 OVER A SELECTED 
TIME PERIOD 



01. CACHE ASSESSMENT SYSTEM 10 APPLIES^ 
THE STATISTICS TO THE APPROPRIATE 
EQUATION DEPENDING ON THE SELECTED 
MODEL, FOR VARIOUS SIZES "S" OF CACHE 
MEMORY 31 TO GENERATE RESPECTIVE 
PREDICTIONS AS TO THE NUMBER OF READ 
MISSES FOR EACH OF THE RESPECTIVE CACHE 
\^ES "S" 



102. BASED ON THE RESPECTIVE 
PREDICTIONS, THE CACHE ASSESSMENT 
SYSTEM 10 OR THE OPERATOR CAN 
DETERMINE WHETHER TO ADJUST THE SIZE OF 
THE CACHE MEMORY 31 AND, IF SO, INITIATE 
OPERATIONS TO ENABLE THE SIZE 
ADJUSTMENT TO OCCUR. 
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